Scaling of the distribution of shortest paths in percolation 
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We present a scaling hypothesis for the distribution function of the shortest paths connecting 
any two points on a percolating cluster which accounts for (i) the effect of the finite size of the 
system, and (ii) the dependence of this distribution on the site occupancy probability p. We test 
the hypothesis for the case of two-dimensional percolation. 
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The chemical distance or minimal path, £, between two sites is defined as the shortest path on a percolating cluster 
connecting the two sites. The quantity of interest here is the conditional probability, P{£\r), that two sites taken 
from the same cluster, separated by geometrical distance r, are £ chemical distance away. For example, in oil recovery 
the first passage time from the injection well to a production well a distance r away is related to P{£\r). In many 
realistic problems where the disordered media controls a transport process, dynamic properties such as conductivity 
and diffusion can also be expressed in terms of chemical distance, in the case of loopless aggregates or aggregates 
for which loops can be neglected . It is known that the average chemical distance £ scales as r'^min ^ where various 
estimates of dmin are dmin ~ 1.130 ±0.005 (|] and dmin ~ 1.1307 ±0.0004 [||. There has been an extensive theoretical 
and computer work done on studying the scaling of P{£\r) but the complete scaling form of P{£\r) which 

accounts for the finite size effect, and off-critical behavior has not been studied in detail. 

We start by recalling the scaling form for a different but related function, the conditional probability P{r\£), the 
probability that two sites, separated by chemical distance £, are a geometric distance r away. For isotropic media 
this probability distribution was studied extensively (see ||l|,^-^). In analogy with the theory of self- avoiding random 
walks (SAWs) ||-|ll|l, it was proposed ^ that 

Pir\£) = A[^yfo{^), (1) 

where At l/£'^, v = 1/dmin = 0.88 ± 0.02, gr = 2.2 ± 0.3 ||l|l and the scaling function /o(a;) = exp(-a2:^) with 
5 = {1 — iy)^^ . The relation between exponent gr and other known exponents was proposed by Havlin ]T^ , who used 
arguments similar to those of de Gennes |^ for SAWs. The distribution P{r\£)r=i determines the probability of the 
minimal path returns to a nearest neighbour site of the origin, which is from Eq. (|l]) 

I / l\9r ^ 

(2) 



On the other hand, the total number of clusters N{£) of chemical size £ scales as M^'^^^{dM/d£), where M is the total 
mass of the cluster, which scales as £'^'^f , where rf/ is the fractal dimension of the percolation cluster, and t ~ 1 + d/d/ 
(see e.g. ||l[). Hence 

N{£) ~ £i>df{-T+l)+i:'df~l ^ j^udf[-T+2)-l ^ ^g-j 

Due to the fact that in the mean field regime the effect of the excluded volume effect is absent, we conjecture]^ that 
the number of clusters No{£) in which the minimum path £ returns to a nearest neighbour site of the origin obeys the 
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^This conjecture is made in analogy with the number of self-avoiding random walks coming to the origin, which scaling lacks 
prefactor N~'~^, i. e. 7 = 1 as in mean field regime (see [^). 
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mean field values for which v = 1/2, r = 5/2, and df = 4. The probability of a cluster with the path returning to 
a nearest neighbour site of the origin is proportional to the fraction of paths No{£)/N{£) which return to a nearest 
neighbour site of the origin and the probability of an end of the path in a volume V to be at the specific site l/V. 
Thus, 



No{i) 1 



N{£) V' 

where V is the volume of the cluster V i'^'^ . The expressions (0) and 



should be equal and thus 



1 



1 



Finally we have 



£u+ugr £i^df(-T+2) + l+di> ' 



df{—T + 2) + dmin + d - 1 = df + d„ 



(4) 



(5) 



(6) 



This value is within the error bars found numerically for gr in d = 2 and d = 3 Jljl- Numerical simulations for 
percolation clusters in c? = 2 are in good agreement with the above scaling form |]l|,^Jl5 . 

We start by recalling a scaling Ansatz for P{£\r), which has been developed in m-m- Exactly a.t p — Pc in the 
infinite system (L = oo), in analogy to (|l|). 



Pie\r) = Ar 



-ge 



h 



£ 



where fi{x) ~ exp(— aa; '^^) is the scaling function corresponding to /o and Ar 
The conditional probabilities P{r\£) and P{£\r) are related by 

P{r\£)P{£) = P{£\r)P{r) = P(r,£) , 



1 

r'^min 



(7) 



is the normalization factor. 



(8) 



where P{r) = / P(r, £)d£ and P{£) = J P{r, £)dr. For the case of the infinite percolation cluster, i. e. when we restrict 
ourself only to the points on the infinite cluster, P{r) and P{£) scale as [Esj 



Pi£) 



l)d,-l 



P{r) 



where di — df/dmin- Thus, 



fdf-l 

P{£\r) ^ P{r\£)^^ 



Substituting Eqs. (|l|) and (0) into Eq. @, we find that 

fi{x) - exp(-aa;~'^i) 

where 

(l)^^Si> = i>{l - D)-^ = l/{drr 



1), 



and m 



ge -1 ^ (.g-r - + (2 - df)^. 



(9) 
(10) 

(11) 
(12) 

(13) 



Using Eq. (||) we find ge — 2 for all c? > 2. Note, that the numerical value for ge w 2.04 found here is very close to 
this prediction. 

Let us now consider the scaling of P'{£\r), which is defined in the same way as P{£\r), but for any two randomly 
chosen points, separated by geometrical distance r and not necessarily belonging to an infinite cluster. In this case. 



P'(r) - rG{r) , 



(14) 



where G{r) is a pair connectedness function and the factor r comes from the summation over all points which are 
equidistant from the origin. The scahng of the G(r) is described in fl^. At p = p^. 
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G{r) - r^-'^-" , 



(15) 



where 77 = 4 — 2df. The exact value of 77 = 5/24 « 0.21 is known for d = 2. Substitution of Eqs. (0) and (^4|) into 
Eq. (0), and norniahzation, yields 



7" mill \ 7" mill 



/i 



1 zr„ 



h 



1 Zd:, 



The integration of P' (r, £) over r gives 

P'{^) - ^2£>-l-£>,, ^ 

Next, we substitute Eqs. (|l|) and (0) to Eq. (|) and compare with (|l|). We obtain 



P'W) = Ar 



h 



where 



Thus from (|3D and (|19|), 



{gr - l)j> + 2(2 - d/)i>. 



91 = (2-d/)i 



(16) 



(17) 



(18) 



(19) 



(20) 



The latter relation between exponents g'^ and g^ has simple probabilistic meaning, since a pair of two randomly selected 
points separated by a chemical distance £ should belong to the cluster of chemical size > I. The probability of 
this event scales as £~'''^/^. Once two points belong to such a cluster, the probability that their chemical distance 
is equal to i scales the same way as on an infinite cluster and is proportional to £~^'. Hence the probability that 
two randomly selected points are separated by a chemical distance I is proportional to the product of these two 
probabilities i-at-vvj^ ^ which, by definition, is . Hence g'i = gi + r]h'/2^ We summarize the values of the above 
exponents for d = 2 in Table I. 

Next we study the finite size effect and off-critical behavior using Eq. (Q) as starting point. We propose that for 
f > r, P'{£\r) has the following scaling form 



P'{i\r) 



1 



-9 1 



h 



h 



I 



h 



(21) 



where the scaling functions are /i(x) = exp(— aa; '^^), f2{x) = exp{—bx'^^) and /3(a;) — exp{—cx). Here ^ is a 
characteristic length of pair connectedness function and has a power-law dependence of occupancy p as 



\P-Pc 



(22) 



The first function /i accounts for the lower cut-off due to the constraint £ > r, while /2 and /a account for the upper 
cut-off due to the finite size effect and due to the finite correlation length respectively. Either /2 and /a becomes 
irrelevant, depending on which of the two values i or ^ is greater. For L < f2 dominates the upper cut-off, otherwise 
/a dominates. We assume the independence of the finite size effect and the effect of the concentration of the vacant 
sites, so the Eq. ( pl| ) can be represented as a product of the terms which are responsible for the finite size effect (/2) 
and the effect of the concentration (/a). Our simulations confirm this assumption numerically. 

There is no particular reason for the choice of the simple exponential decay of the function /a, but we find that this 
function is a straight line on a semilogarithmic plot (see discussion below). However, due to the fact that the pair 



connectedness function G{r) ~ ''exp(— r/^) has simple exponential form |16| it is reasonable to assume that 



■^Indeed, riu/2 = Pv/v, where {3 is defined as the exponent of the probability to belong to the infinite cluster. Poo ~ \p — pc\^ . 
Analogous relation is known, for example, for the exponents 7 and l/cr, which define respectively the scaling of the average 
and the typical size of the cluster: l/cr = 7 -f /3. 
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function /s has a form of a stretched exponential /a exp{—cx'^). However, since the exponent iy « 0.88 is close to 
1, we are not able to resolve this question in our simulations. 

Next, using the Leath method [pT|-p^, which corresponds to calculation of P'{£, r) ^ we numerically test the sca ling 
conjecture Eq. ( pT| ) exactly at the percolation threshold p = Pc- In this case, — oo so — const. Hence Eq. (|2l| ) 
reduces to 



p'ie\r) 



-9t 



h 



h 



(23) 



Indeed, Fig. la shows that P'{i\r) has power law behavior for 



<£<L'' 



and for £ > L'^min, In order to test Eq. (|3|) we compute rescaled probability distribution 

$ ( = P'{£\r)iair-'^miniai'i) = . f^^ 

\ 7*"inin J 

and plot it against scaling variable x = £/r'^min (see Fig. lb). According to Eq. (|2^), 



and rapidly vanishes for £ < r'^n 



$(x) = A/i(a;)/2 



(24) 



(25) 



Therefore, ^{x) should depend only on x and the ratio L/r. Indeed, Fig. lb shows excellent data collapse for L/r — 16, 
with sharp cutoffs governed for small x < 1 by fi{x) and for large x > (L/r)'^min by /2[a;(r/L)''min]. 

In order to test the assumption that the functions /i and /2 are stretched exponentials with exponents 0i and 
(j)2 we plot Il{x) — \og[A/^{x)] versus x in double logarithmic scale for various values of normalization constant A. 
(See Fig. 2.) If the stretched exponential conjecture is correct, H(x) should have two straight line asymptotes for 
logx oo with the slope (j)2 and for logx — > — oo with the slope The slopes (j)! and 02 of the straight line fits 

depend weakly on the value of A. Using A = 1.65, we obtain the longest regimes of straight line behavior. For this A 
we obtained 0i ~ 7.3 and (f>2 ~ 4.0. The value of is in good agreement with Eq. (p^ , while 02 seems to be a new 
exponent. 

Finally, in order to test the dependence of P'{£\r) on p we obtain data for very large system size L and for several 
values of p ^ Pc- In this case, the upper cutoff of the distribution Eq. (|2j) is governed by and the functional form 
of the rescaled probability <& is given by 



For large £, we suggest an exponential decay of $: 



$(£/r''min) ~ exp 



£ 



(26) 



(27) 



Indeed, semi-logarithmic plots of log $(^/r'*min) versus £ shown in Fig . 3a can be approximated by the straight lines 
with slopes which approach to zero as p — > pc- According to Eq. (p^), these slopes k{p) should be proportional to 
^^''min = |p_p^|<imin'^ ^ pjg^ 3b shows doublc logarithmic plots of | A: (p) | versus |p — Pc| for p < pc and for 

p > Pc, which can be well approximated by straight lines with slopes 1.55 and 1.57 in good agreement with scaling 
conjecture, Eq. (^. 

The scaling form, Eq. (pT|) , is limited to the case when ^ > r. For S, < r, the finite size effects can be neglected, the 
power law regime vanishes, and the minimal path can be divided into r/^ independent blobs each of length ^''min^ so 
that the distribution P'{£\r) approaches Gaussian form with mean r^'^min^i and variance r^^rfmin^i. 



^ In the Leath method, starting a cluster from the origin corresponds to selection of a random point in the disordered media. 
Construction of the Leath cluster corresponds to the determination of all the points which are connected to the starting one. 
In our procedure, we compute P'(£,r) by dividing the number of the pair of points A'^.^ which are separated by r and £ by 
the total number of points A*' in all constructed clusters. The quantity of interest Ni^r/N is equivalent to the quantity P'{£,r), 
which by definition is the probability that two randomly chosen points, belonging to the same cluster, are separated by r and 
£. 
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In many realistic problems an interesting quantity is the first passage time distribution. If we assume that the first 
passage time t between two points is linearly proportional to shortest chemical distance £ between them then we can 
identify the distribution P'{i\r) as a first passage time distribution. This is only true when the spreading velocity 
through the media remains constant along the path. In the oil recovery problem it is the injection rate, the volume 
of the fluid pumped into the media per unit time, which is maintained constant, not the velocity itself. We can 
generalize the relation between first passage time and chemical distance hy t ^ For the given fractal dimension ds 
of the backbone of the percolating cluster we know that M(r) ~ r*^^ where M(r) is the total mass of the backbone of 
the percolation cluster of radius r. Constant pumping rate gives dt ^ dM ~ r'^'^~^dr. Combining with the relation 
£ r'^min and their derivative d£ ~ r'^min'^dr we can get another scaling relation z = ds/dmin = 1.625/1.14 w 1.43. 
If we substitute £ ~ f^/^ in Eq. (|||) it gives the correct scaling form for the first passage time distribution in case of 
constant injection rate. 

Another problem comes from the presence of a second well, which might establish some pressure potential field 
through the media and bring some anisotropic spreading front toward the production well when we apply some external 
driving pressure. This can be corrected by calculating the potential field which is a solution of Laplace's equation with 
two point boundaries and calculating the local velocity on the percolating cluster including the quenched disorder of 
the percolation cluster. Practically the problem of multiple wells scattered along the oil fields with equal spacing is 
of particular interest in oil recovery. 

In summary, we have studied the scaling properties of the shortest paths distribution for fixed two points on the 
percolating cluster which accounts for the finite size effect, also off criticality. We propose a plausible scaling hypothesis 
for the distribution, which is supported by theoretical argument and tested by numerical simulation. The lower and 
upper cut-offs of the distribution has been numerically observed and fitted successfully by stretched exponential 
function. Off the critical point the upper cutoff, due to the finite correlation length, becomes a pure exponential form. 
Further work is underway to test the conjectures and hypothesis developed here. 

We thank Prof. D. Stauffer for fruitful discussion, the referee for the helpful suggestions, and BP for the financial 
support. 
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0.14 


1.33 


0.20 


1.13 


0.88 


2.2 


2.04 


2.14 


7.3 


4.0 



Table I: Numerical estimates for the various exponents. 
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FIG. 1. (a) Log-log plot of P'{£\r) for p — pc = 0.593 and for different sets of parameters, {r,L). The straight line regime 
has slope g'l = 2.14. (b) Log-log plot of rescaled probability "I>(a::) = P'(£|r)i!®f r~'^min(Sf~i) against rescaled length x — ^/r'^min 
using the values, g'l = 2.14 and dmin = 1.14. 




FIG. 2. Log-log plot of transformed probability n(a::) = log[A/${x)] versus x — ^/r'^min. The slopes of the solid lines give 
the power of the stretched exponential function /i and /2 in Eq. (p5[). Using the parameter A — 1.65, the slopes give ~ 7.3 
for the lower cut-off and (j)2 ~ 4.0 for the upper cut-off. 
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FIG. 3. Semi- logarithmic plot of transformed probability $ versus I {r = 32) shows pure exponential behavior of /s. (a) 
(O) denotes p = 0.613, (□) p = 0.633, (O) p = 0.655 and (x) p = 0.673. (b) (O) denotes p = 0.533, (□) p = 0.553, (O) 
p = 0.573 and (x) p = 0.583. (c) The slope of the log-log plot of the coefficient in exponential function fs as a function of 
\p — pc\ gives the value t'dmin 1-55 for p > pc and 1.57 for p < pc. 
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